ON THE ERDOS-STRAUS CONJECTURE 



EUGEN J. lONASCU AND ANDREW WILSON 

Abstract. Paul Erdos conjectured that for every n G N, n > 2, there exist a, b, c natural numbers, 
not necessarily distinct, so that ^ = \ + \ + \ (see [3]/ In this paper we prove an extension of 
Mordell's theorem and formulate a conjecture which is stronger than Erdos' conjecture. 

o 

^ • 1. INTRODUCTION 



The subject of Egyptian fractions (fractions with numerator equal to one and a positive integer 
as its denominator) has incited the minds of many people going back for more than three millennia 
, and continues to interest mathematicians to this day. For instance, the table of decompositions of 

■ fractions 2k+i ^ sum of two, three, or four unit fractions found in the Rhind papyrus has been 

the matter of wander and stirred controversy for some time between the historians. Recently, in 
[1], the author proposes a definite answer and a full explanation of the way the decompositions 
were produced. Our interest in this subjected started with finding decompositions with only a few 
> : unit fractions. 

' It is known that every positive rational number can be written as a finite sum of different 

unit fractions. One can verify this using the so called Fibonacci method and the formula ^ = 
, + ^^J^j^^ , n E N. For more than three fourths of the natural numbers n, ^ can be written as 

. sum of only two unit fractions: the even numbers, and the odd numbers n of the form n = Ak — 1, 

via the identities ~ I + fc(2fc-i) ' 4F^ ~ i + fc(4fc-i) ' k ^ N. It is clear that if we want to 



X 



write 

4 111 

- = - + T + -> a, 6,c G N, 
n a c 

we can just look at primes n. However, just as a curiosity, for n = 2009 = 7^(41) (multiple of four 
plus one) one needs still two unit fractions, and there are only three such representations: 

4 1 1 1111 

+ = 3777 + "777777 = T7777 + 



2009 504 144648 574 4018 588 3444 
This follows from the following characterization theorem which is well known (see [2] and [4j) 
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Theorem 1.1. Let m and n two coprime positive integers. Then 

mil 
nab 

for some positive integers a and b, if and only if there exists positive integers x and y such that 
(i) xy divides n, and 
(a) X + y = (mod m). 

In what follows we will refer to the equality 

4 111 

1 - = -+ +-. 

n a c 

and say that n has a representation as in ([1]), or that ([1]) has a solution, if there exist a, b, c 
{a < b < a) natural numbers, not necessarily different satisfying ([1]). Since ^ > ^, the smallest 
possible value of a is [n/4]. The biggest possible value of a is [^J, for instance | = | + | + 5- 
If n = 4A: + 1 (A; G N) then we can try to use the smallest value first for a, i.e. a = k + 1: 

4 _ 1 3 

^' 4k + l~ k + 1^ {k + l){4k + l)' 

Now, if the second term in the right hand side of ^ could be written as a sum of two unit 
fractions we would be done. This is not quite how the things are in general, but if we analyze the 
cases k = 3l + r with r E {0, 1, 2}, / > 0, / € Z, we see that there is only one excepted case in which 
we get stuck: k = 31. This because Theorem 1 1 . 1 1 can be used in one situation: A; = 3/ + 1 implies 
l + (3/ + l + l) = (mod 3). On the other hand, if A; = 3/+2 weget (A;+l) = (3/+2) + l = 3(/ + l) > 3 
and the second term is already a unit fraction. 

In order to simplify the statements of some of the facts in what follows we will introduce a 
notation. For every i S N let Cj be defined by 

n + 4i — 1 

d := {n\ ^ has a solution with a < }. 

It is clear that Cj C Cj+i and then Erdos-Stiaus^ conjecture is equivalent to |J Cj = N. Thus we 



obtained a pretty simple fact about the Diophantine equation ([T]): 



Proposition 1.2. The equation (QP has at least one solution for every prime number n, except 
possibly for those primes of the form n = 1 (mod 12). Moreover, 



N\Ci C {n\n = 1 {mod 12)}. 
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We observe that 12 = 2^(3), a product of a combination of the first two primes. The first prime 
that is excluded in this proposition is 13. The equality ([2]) becomes 



4 _ 1 3 
^' 12Z + 1 ~ 3/ + 1 ^ (3/ + 1)(12/ + 1) ■ 

At this point we can do another analysis modulo any other number as long we can reduce the 
number of possible situations for which we cannot say anything about the decomposition as in ([T|). 
It is easy to see that 3/ + 1 is even if / is odd and then Theorem 11.11 can be used easily with x = 1 
and y = 2. This means that we have in fact an improvement of the Proposition 11.21 

Proposition 1.3. The equation (QP has at least one solution for every prime number n, except 
possibly for those primes of the form n = 1 (mod 24)- In fact, 

N \ Ci C {n\n = 1 {mod 24)}. 

Let us observe that 24 + 1 = 5^ , 48 + 1 = 7^ , which pushes the first prime excluded by this last 
result to 73. Quite a bit of progress if we think in terms of the primes in between that have been 
taken care off, almost by miracle. 

If n = 24A; + 1, then the smallest possible value for a is 6A; + 1 and at this point let us try now 
the possibility that a = 6k + 2 = 

4 1 7 

(A) = I k gN 

24A; + 1 6A; + 2 2(3fc + l)(24A: + l)' 
In the right hand side of the second term has a bigger numerator but the denominator has 
now at least three factors. This increases the chances that Theorem 11.11 can be applied and turn 
that term into a sum of only two unit fractions. Indeed, for k = 7l + r, we get that n = 24A: + 1 = 
(mod 7) if r = 2, 2{3k + 1) + 1 = (mod 7) if r = 3, n + 1 = 2(12A; + 1) = (mod 7) if r = 4, and 
n + 2 = 24A; + 3 = (mod 7) if r = 6. Calculating the residues modulo 168 in the cases r € {0, 1, 5} 
we obtain: 

Proposition 1.4. The equation m) has at least one solution for every prime number n, except 
possibly for those primes of the form n = r (mod 168), with r G {1,52,11^}, k £ Z, k > 0. More 
precisely, 

N \ C2 C {n\n = 1, 5^ 11^ {mod 168)}. 

Let us observe that 168 = 2^ (3) (7), 168 + 1 = 13^, and the excepted residues modulo 168 are 
all perfect squares. Because of this, somehow, the first prime that is excluded by this result is 
193 = 168 + 25. Again, we have even a higher jump in the number of primes that have been taken 
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care of. As we did before there is an advantage to continue using ^ and do an analysis now on k 
modulo 5. 

For /c = 5/ + r, we have n = (mod 5) if r = 1, 3A: + 1 = (mod 5) if r = 3, and G/c + 1 = 
(mod 5) if r = 4, which puts n € C2 again. Therefore, we have for r G {0, 2} the following excepted 
residues modulo 120. 

Proposition 1.5. The equation ^^ has at least one solution for every prime number n, except 
possibly for those primes of the form n = r (mod 120), with r E {1,7^}, k G 1^, k > 0. More 
precisely, 



One can put these two propositions together and get Mordell's Theorem. 

Theorem 1.6. (Mordell [3]) The equation has at least one solution for every prime num- 
ber n, except possibly for those primes oftheformn = 840A;+r, wherer € {1,11^,13^,17^,19^, 23^}, 
A; € k >0. Moreover, we have 



Proof. By Proposition 11.41 n = I68k + 1 may be an exception but if A; = 51 + r, with 
r € {0,1,2,3,4} we have n = 1 or 7^ (mod 120) only for r € {0,1}. These two cases are the 
exceptions for both propositions and they correspond to n = 1 or 13^ (mod 840). All other 



Let us observe that 840 = 2^ (3) (5) (7) and the residues modulo 840 are all perfect squares. Not 
only that but 840+1 = 29^, 840+11^ = 31^, and 1009 = 840 + 13^ is the first prime that is excluded 
by this important theorem. While 193 is the 44-th prime number, 1009 is the 169*'' prime. It is 
natural to ask if a result of this type can be obtained for an even bigger modulo. We will introduce 
here the next natural step into this analysis, which implies to allow a be the next possible value, 
i.e. ""^^-^ , and we will be using the identities 



N \ C2 C {n\n = 1, 72 (mod 120)}. 



N \ C2 C {n\n = 1, ll^ 132, 17^ 19^, 23^ {mod 840)}. 



excepted cases are obtained the same way. 



(5) 



4 



1 11 



ken. 



120k + 1 



30A; + 3 3(10fc + l)(120A: + l) 



(6) 



4 1 11 



ken. 



120fc + 49 30A; + 15 3(5)(2A: + l)(120/c + 49) 
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2. The analysis modulo 11 

According to Proposition 1 1 . 51 we may continue to look only at the two cases modulo 120 and use 
only the two formulae above. If we continue the analysis modulo 11 in these two cases we obtain 
the following theorem. 

Theorem 2.1. The equation ([ip has at least one solution for every prime number n, except 
possibly for those primes of the form n = 1320A: + r, where 

r € {l,7^13^17^19^ 23^29^31^ 7(103), 1201, 7(127), 23(47)} :=^, /cGZ, /c>0. 

Moreover, we have 

N \ C3 C {n\n G E{mod 1320)}. 

Proof. If n = 120A; + 1 and A; = IIZ + 1, we see that n = (mod 11) and so ([5]) gives the 
desired decomposition as in ([T]) right away. If A; = IIZ + r and r S {2,4,5} the Theorem 11.11 can 
be employed to split the second term in ([5]) as a sum of two unit fractions. For instance, for r = 5 
we have 1 + 3(10(11/ + 5) + 1) = (mod 11), so one can take m = 11, x = 1 and y = 30k + 3 in 
Theorem 1 1.1[ Hence we have seven exceptions in this situation: 

• r = corresponds to n = 1 (mod 1320), 

• r = 3 gives n = 19^ (mod 1320), 

• r = 6 corresponds to n = 7(103) (mod 1320), 

• r = 7 gives n = 29^ (mod 1320), 

• r = 8 corresponds to n = 31^ (mod 1320), 

• r = 9 gives n = 23(47) (mod 1320), and finally 

• r = 10 corresponds to n = 1201 (mod 1320). 

If n = 120/c + 49 and A; = 11/ + r, then for r = 5 we have n = (mod 11). If r G {3, 5, 6, 8, 9, 10} 
we can use Theorem ll.il The exceptions then are: 

• r = corresponds to n = 7^ (mod 1320), 

• r = 1 gives n = 13^ (mod 1320), 

• r = 2 corresponds to n = 17^ (mod 1320), 

• r = 4 gives n = 23^ (mod 1320), 

• r = 7 corresponds to n = 7(127) (mod 1320). ■ 
Putting Theorem 11.41 and Theorem 12.11 together we get the following 36 exceptions: 
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12 


132 


172 


192 


232 


292 


312 


1201 


372 


412 


432 


13(157) 


472 


2521 


19(139) 


2689 


532 


3361 


592 


3529 


612 


29(149) 


672 


712 


13(397) 


732 


5569 


17(353) 


31(199) 


792 


832 


7561 


7681 


892 


8089 


8761 



The residue 1201, the first prime in this fist is not really an exception because of the following 
identity: 



(7) 



1 



+ 



1 



+ 



1 



9240A; + 1201 2310/c + 308 5(9240A; + 1)(15A; + 2) ' 770(9240/c + 1)(15A; + 2) ^ 

which shows that 9240A; + 1201 e Cs for all k e Z, k > 0. 

We checked for similar identities and found just another similar identity for the exception 
17(353) = 6001: 



1 



(8) 



9240A; + 6001 2310/fc + 1540 



1 



+ 



385(9240A; + 6001)(2034/i; + 1321) ^ 22{3k + 2)(2034A; + 1321) ' 
which shows that 9240 A; + 6001 e C40 for all € Z, A; > 0. 

Theorem 2.2. The equation fi]) has at least one solution for every prime number n, except 
possibly for those primes of the form n = r (mod 924-0) where r is one of the 34 entries in the table: 



l2 


13^ 


17^ 


192 


23^ 


292 


31^ 


37^ 


4P 


43^ 


13(157) 


47^ 


2521 


19(139) 


2689 


53^ 


3361 


592 


3529 


61^ 


29(149) 


67^ 


71^ 


13(397) 


732 


5569 


31(199) 


79^ 


832 


7561 


7681 


89^ 


8089 


8761 







Moreover, if n is not of the above form, it is in the class C3, or in Cg if n = 1201 (mod 9240), 
or in C40 ifn = 6001 (mod 9240). 

Proof. We look to see for which values of r G {0, 1, 2, 3, 4, 5, 6} we have 1320(7/ + r) + s G 
{1,52,112} (mod 168) with s E -E (i? as in Theorem 12. ip . For instance, if s = I92 we get that r 
must be in the set {0,1,3} in order to have 1320(7/ + r) + I92 G {l,52,ll2} (mod 168). These 
three cases correspond to residues I92, 4l2 and 29(149) modulo 9240. Each residue in E gives rise 
to three exceptions. We leave the rest of this analysis to the reader. ■ 
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3. Numerical Computations and Comments 

We observe that the first ten of these residues in Theorem 12.21 are all perfect squares. In fact, 
all 19 squares of primes less than 9240 and greater than 11^ are all excepted residues. There is 
something curious about the fact that all the perfect squares possible are excepted. This may be 
related with the result obtained by Schinzel in [5] who shows that identities such as d?]), ([8]) and 
others in this note, cannot exist if the residue is a perfect square. The same phenomenon is actually 
captured in Theorem 2 in |7j. The good news about Theorem 11.61 Theorem 12. !( and Theorem 12.21 
is that the first excepted residues are all perfect squares or composite and moreover their number 
is essentially increasing with the moduli. 

With our analysis unfortunately, there are a few other composite and 9 prime residues that have 
to be excluded. The prime 2521 is only the 369-th prime and it is the first prime that is excluded 
by this theorem. However, a decomposition with the smallest a possible is exhibited in the equality 

4 _ 1 1 1 

2521 ~ 636 70588 ^ 5611746' 

which puts 2521 G Cg. The other primes are in the smallest class C as follows: 



Ci 


C2 




C4 


Cs 


Ce 


3361, 7681, 8089 


3529, 5569 


8761 


2689 


7561 


2101, 2521 



Clearly, one can continue this type of analysis by adding more primes to the modulo which is 
at this point 9240. It is natural to just add the primes in order regardless if they are of the form 
4A; + 1 or 4/c + 3. We see that Erdds^ conjecture is proved to be true if one can show that the 
smallest excluded residue for a set of moduli that converges to infinity is not a prime. One way to 
accomplish this is to actually show that the pattern mentioned above continues, i.e. the number 
of excluded residues which are perfect squares or composite is essentially growing as the modulus 
increases. This is actually our conjecture that we talked about in the abstract. Numerical evidence 
points out that for residues r which are primes, we have 9240s + r G C^g^y.) with k{s,r) bounded 
as a function of s. For example, 9240s + 2521 G C12 for every s = 1.. 100000 and the distribution 
through the smaller classes is 





C2 


C3 


C4 


C5 


Ce 


C7 


Cs 


Cg 




Cii 


C12 


10852 


6444 


5332 


811 


612 


277 


63 


82 


6 


7 





5 


44.3% 


26.31% 


21.78% 


3.3% 


2.5% 


1.13% 


0.26% 


0.34% 


0.025% 


0.029% 


0% 


0.021% 



Now, if we add 13 to the factors we would have an analysis modulo 120120. It turns out that 
2521 is not an modulo 120120 exception because of the identity 
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4 _ 1 1 1 

120120fc + 2521 ~ 30030/c + 4004 ^ 1001(120120A; + 2521)(810A; + 17) 22(15A; + 2)(810fc + 17) ' 

which shows that 120120fc + 2521 G C3374 for all k e Z, k > 0. 

We found similar identities for the residues 2689, 3529, 29(149), 5569, 31(199), 7561, and 7681 
modulo 120120. This suggests that one may actually be able to obtain Mordell type results for 
bigger moduli, in the sense that the perfect squares residues appear essentially in bigger numbers, 
by implementing a finer analysis that involves higher classes than C3. It is natural to believe that 
this might be true taken into account that Vaughan [6] showed that 



— #{n e N| n < m, and ^ does not have a solution} < e ^^^^ '"-'^ G N, 

for some constant c > 0. This is saying, roughly speaking, that the proportion of the those n <m 
for which a writing with three unit fractions of 4/n goes to zero a little slower than ^ as m — > cxo. 
The first few primes that require a bigger class than the ones before are 2, 73, 1129, 1201, 21169, 
118801, 8803369,..., corresponding to classes Ci, C2, C3, C4, Cs, C15, C27, .... which shows a steep 
increase in the size of classes relative to the number of jumps. 

In [7], Yamamoto has a different approach to ours and obtains a lesser number of exceptions at 
least for the primes involved in Theorem 12.21 For each prime p of the form 4A; + 3 between 11 and 
97, there is a table in [7] of exceptions for congruency classes r [n = r (mod p) ) that is used to 
check the conjecture using a computer for al n < 10^. However, in [3], Richard Guy mentions that 
the conjecture is checked to be true for all n < 1003162753. 

We extended the search for a counterexample further for all n < 4, 146, 894, 049. For our 
computations we wrote a program that pushes the analysis for a modulus of M = 2, 762, 760 = 
2^(3)(5)(7)(11)(13)(23). The primes chosen here are optimal, in the sense that the excepted residues 
are in number less than the ones obtained by other options. The first 12 exceptions in this case are 
1, 172, 192, 292, 31^, 372, 4l2, 432, 472, 532, 3361, and 59^. The number of these exceptions was 
2299 but it is possible that our program was not optimal from this point of view. Nevertheless, 
this meant that we had to check the conjecture, on average, for every other ~ 1201 integer. The 
primes generated, 889456 of them, are classified according to the smallest class they belong to in 
the next tables: 



Ci 


C2 


C3 


C4 


C5 


Ce 


C7 


Cg 


C9 




Cii 


C12 


380547 


228230 


128494 


61129 


50853 


17116 


8459 


9580 


1836 


1386 


547 


855 


42.8% 


25.7% 


14.4% 


6.9% 


5.7% 


2% 


0.9% 


1% 


0.2% 


0.15% 


0.06% 


0.096% 
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Cl3 


Ci4 


Cl5 


Cl6 


Cl7 


Cl8 


Cl9 


C20 


C21 


C22 


C23 


C24 


115 


124 


111 


26 


10 


27 


2 


4 


4 











0.013% 


0.014% 


0.012% 


0.003% 


0.001% 


0.003% 


0.0002% 


0.00045% 


0.00045% 


0% 


0% 


0% 



C25 


C26 


C27 








1 


0% 


0% 


0.0001% 



So far, we have not seen a prime in a class Ck with A; > 27. The result obtained in [5] seems to 
imply that the minimum class index for each prime, assuming the conjecture is true, should have 
a limit superior of infinity. 
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